Understanding the errors of SHAPE-directed RNA structure modeling.
نویسندگان
چکیده
Single-nucleotide-resolution chemical mapping for structured RNA is being rapidly advanced by new chemistries, faster readouts, and coupling to computational algorithms. Recent tests have shown that selective 2'-hydroxyl acylation by primer extension (SHAPE) can give near-zero error rates (0-2%) in modeling the helices of RNA secondary structure. Here, we benchmark the method using six molecules for which crystallographic data are available: tRNA(phe) and 5S rRNA from Escherichia coli, the P4-P6 domain of the Tetrahymena group I ribozyme, and ligand-bound domains from riboswitches for adenine, cyclic di-GMP, and glycine. SHAPE-directed modeling of these highly structured RNAs gave an overall false negative rate (FNR) of 17% and a false discovery rate (FDR) of 21%, with at least one helix prediction error in five of the six cases. Extensive variations of data processing, normalization, and modeling parameters did not significantly mitigate modeling errors. Only one varation, filtering out data collected with deoxyinosine triphosphate during primer extension, gave a modest improvement (FNR = 12%, and FDR = 14%). The residual structure modeling errors are explained by the insufficient information content of these RNAs' SHAPE data, as evaluated by a nonparametric bootstrapping analysis. Beyond these benchmark cases, bootstrapping suggests a low level of confidence (<50%) in the majority of helices in a previously proposed SHAPE-directed model for the HIV-1 RNA genome. Thus, SHAPE-directed RNA modeling is not always unambiguous, and helix-by-helix confidence estimates, as described herein, may be critical for interpreting results from this powerful methodology.
منابع مشابه
Statistical analysis of SHAPE-directed RNA secondary structure modeling.
The ability to predict RNA secondary structure is fundamental for understanding and manipulating RNA function. The information obtained from selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) experiments greatly improves the accuracy of RNA secondary structure prediction. Recently, Das and colleagues [Kladwang, W., et al. (2011) Biochemistry 50, 8049-8056] proposed a "bootstra...
متن کاملPrinciples for understanding the accuracy of SHAPE-directed RNA structure modeling.
Accurate RNA structure modeling is an important, incompletely solved, challenge. Single-nucleotide resolution SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) yields an experimental measurement of local nucleotide flexibility that can be incorporated as pseudo-free energy change constraints to direct secondary structure predictions. Prior work from our laboratory has emphasi...
متن کاملRelation Between RNA Sequences, Structures, and Shapes via Variation Networks
Background: RNA plays key role in many aspects of biological processes and its tertiary structure is critical for its biological function. RNA secondary structure represents various significant portions of RNA tertiary structure. Since the biological function of RNA is concluded indirectly from its primary structure, it would be important to analyze the relations between the RNA sequences and t...
متن کاملNonlinear Analysis of a Flexible Beam Actuated by a Couple of Active SMA Wire Actuators
There are two different ways of using SMA wires as actuators for shape control of flexible structures; which can be either embedded within the composite laminate or externally attached to the structure. Since the actuator can be placed at different offset distances from the beam, external actuators produce more bending moment and, consequently, considerable shape changes with the same magnitude...
متن کاملIn silico Analysis and Molecular Modeling of RNA Polymerase, Sigma S (RpoS) Protein in Pseudomonas aeruginosa PAO1
Background: Sigma factors are proteins that regulate transcription in bacteria. Sigma factors can be activated in response to different environmental conditions. The rpoS (RNA polymerase, sigma S) gene encodes sigma-38 (σ38, or RpoS), a 37.8 kDa protein in Pseudomonas aeruginosa (P. aeruginosa) strains. RpoS is a central regulator of the general stress response and operates in both retroa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Biochemistry
دوره 50 37 شماره
صفحات -
تاریخ انتشار 2011